gt data
Mitigating Bad Ground Truth in Supervised Machine Learning based Crop Classification: A Multi-Level Framework with Sentinel-2 Images
A, Sanayya, Shetty, Amoolya, Sharma, Abhijeet, Ravichandran, Venkatesh, Gosuvarapalli, Masthan Wali, Jain, Sarthak, Nanjundiah, Priyamvada, Dutta, Ujjal Kr, Sharma, Divya
In agricultural management, precise Ground Truth (GT) data is crucial for accurate Machine Learning (ML) based crop classification. Yet, issues like crop mislabeling and incorrect land identification are common. We propose a multi-level GT cleaning framework while utilizing multi-temporal Sentinel-2 data to address these issues. Specifically, this framework utilizes generating embeddings for farmland, clustering similar crop profiles, and identification of outliers indicating GT errors. We validated clusters with False Colour Composite (FCC) checks and used distance-based metrics to scale and automate this verification process. The importance of cleaning the GT data became apparent when the models were trained on the clean and unclean data. For instance, when we trained a Random Forest model with the clean GT data, we achieved upto 70\% absolute percentage points higher for the F1 score metric. This approach advances crop classification methodologies, with potential for applications towards improving loan underwriting and agricultural decision-making.
- Banking & Finance (0.90)
- Food & Agriculture > Agriculture (0.68)
A System-driven Automatic Ground Truth Generation Method for DL Inner-City Driving Corridor Detectors
Ruthardt, Jona, Michalke, Thomas
Data-driven perception approaches are well-established in automated driving systems. In many fields even super-human performance is reached. Unlike prediction and planning approaches, mainly supervised learning algorithms are used for the perception domain. Therefore, a major remaining challenge is the efficient generation of ground truth data. As perception modules are positioned close to the sensor, they typically run on raw sensor data of high bandwidth. Due to that, the generation of ground truth labels typically causes a significant manual effort, which leads to high costs for the labelling itself and the necessary quality control. In this contribution, we propose an automatic labeling approach for semantic segmentation of the drivable ego corridor that reduces the manual effort by a factor of 150 and more. The proposed holistic approach could be used in an automated data loop, allowing a continuous improvement of the depending perception modules.
Trade Crypto with#NoCode Machine Learning Based On Google Trends
Google trends (GT) is an under-utilized superweapon and harvests a massive amount of search data. But, it hasn't been possible to use GT for real time machine learning tasks, such as predicting stock price or crypto currency movements, until now....In this blog, we'll explain the problem with GT for machine learning, the fix to GT data and the edge we've built in crypto trading models at edgebase.io.We are currently looking for experienced crypto traders as beta testers for our product - please reach out to hello@edgebase.io! Edgebase.io is a no-code platform for building your own AI trading signals (initially cryptos only).
Towards the Use of Neural Networks for Influenza Prediction at Multiple Spatial Resolutions
Aiken, Emily L., Nguyen, Andre T., Santillana, Mauricio
We introduce the use of a Gated Recurrent Unit (GRU) for influenza prediction at the state- and city-level in the US, and experiment with the inclusion of real-time flu-related Internet search data. We find that a GRU has lower prediction error than current state-of-the-art methods for data-driven influenza prediction at time horizons of over two weeks. In contrast with other machine learning approaches, the inclusion of real-time Internet search data does not improve GRU predictions.
- North America > United States > Pennsylvania (0.05)
- Oceania > New Zealand (0.04)
- Oceania > Australia (0.04)
- (4 more...)